Graph-based data selection for the construction

نویسندگان

  • Steven Maenhout
  • Bernard De Baets
  • Geert Haesaert
چکیده

31 Efficient genomic selection in animals or crops requires the accurate prediction of 32 the agronomic performance of individuals from their high-density molecular marker 33 profiles. Using a training data set that contains the genotypic and phenotypic informa34 tion of a large number of individuals, each marker or marker allele is associated with 35 an estimated effect on the trait under study. These estimated marker effects are subse36 quently used for making predictions on individuals for which no phenotypic records are 37 available. As most plant and animal breeding programs are currently still phenotype38 driven, the continuously expanding collection of phenotypic records can only be used 39 to construct a genomic prediction model if a dense molecular marker fingerprint is 40 available for each phenotyped individual. However, as the genotyping budget is gen41 erally limited, the genomic prediction model can only be constructed using a subset of 42 the tested individuals and possibly a genome-covering subset of the molecular mark43 ers. In this paper, we demonstrate how an optimal selection of individuals can be made 44 with respect to the quality of their available phenotypic data. We also demonstrate how 45 the total number of molecular markers can be reduced while a maximum genome cov46 erage is ensured. The third selection problem we tackle is specific to the construction 47 of a genomic prediction model for a hybrid breeding program where only molecular 48 marker fingerprints of the homozygous parents are available. We show how to identify 49 the set of parental inbred lines of a predefined size which has produced the highest 50 number of progeny. These three selection approaches are put into practice in a simu51 lation study where we demonstrate how the trade-off between sample size and sample 52 quality affects the prediction accuracy of genomic prediction models for hybrid maize. 53

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Integrated Risk-Based Technique for Project Plan Selection

  Selecting an effective project plan is a significant area in the project management. The present paper introduces a technique to identify the project plan efficient frontier for assessing the alternative project plans and selecting the best plan. The efficient frontier includes two criteria: the project cost and the project time. Besides, the paper presents a scheme to incorporate Directed Ac...

متن کامل

Epileptic seizure detection based on The Limited Penetrable visibility graph algorithm and graph properties

Introduction: Epileptic seizure detection is a key step for both researchers and epilepsy specialists for epilepsy assessment due to the non-stationariness and chaos in the electroencephalogram (EEG) signals. Current research is directed toward the development of an efficient method for epilepsy or seizure detection based the limited penetrable visibility graph (LPVG) algorith...

متن کامل

Analysis of Resting-State fMRI Topological Graph Theory Properties in Methamphetamine Drug Users Applying Box-Counting Fractal Dimension

Introduction: Graph theoretical analysis of functional Magnetic Resonance Imaging (fMRI) data has provided new measures of mapping human brain in vivo. Of all methods to measure the functional connectivity between regions, Linear Correlation (LC) calculation of activity time series of the brain regions as a linear measure is considered the most ubiquitous one. The strength of the dependence obl...

متن کامل

Rough sets theory in site selection decision making for water reservoirs

Rough Sets theory is a mathematical approach for analysis of a vague description of objects presented by a well-known mathematician, Pawlak (1982, 1991). This paper explores the use of Rough Sets theory in site location investigation of buried concrete water reservoirs. Making an appropriate decision in site location can always avoid unnecessary expensive costs which is very important in constr...

متن کامل

Learning manifold to regularize nonnegative matrix factorization

In this chapter we discuss how to learn an optimal manifold presentation to regularize nonegative matrix factorization (NMF) for data representation problems. NMF, which tries to represent a nonnegative data matrix as a product of two low rank nonnegative matrices, has been a popular method for data representation due to its ability to explore the latent part-based structure of data. Recent stu...

متن کامل

Selection of internal safety auditors in an Indian construction organization based on the SWARA and ARAS methods

Background: The effectiveness and adequacy of occupational health and safety management system should be monitored and evaluated at organization level on a regular basis. Safety audit has a clear role in the development of organizations safety management systems. Internal safety audit is a method to appraise to the management the current status of occupational health and safety at workplace. Se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010